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Urn a das maneiras de o\har o eficie de produzir 
infermagees socials , ecenemicas e territorials e como arte de 
descrever o munde. Estatfsticas e m apas transporters os fenemen os 
da realldade para escalas apropriadas a perspective de nossa vlsao 
humana e nos permitem pens ar e aglr a distancia, construindo 
avenldas de mao dupla que juntam o mu ndo e su as Imogens, Maier o 
peder de sintese desses representagoes , cembinande, com precisao , 
elementes disperses e heteregenees de cetidiene, meier e nesse 
cenhecimen+e e a nossa capecidade de eempreender e trensfermer a 
realldade. 

Viste cemo arte, o eficie de preduzir essas infermagees 
reflete a culture de um Pais e de sue epeca, ceme essa cultura ve e 
munde e e terna visivel, redefininde e que ve e e que ha para se ver. 

No cenarlo de centfnua inevagae tecnelegica e mudanga 
de culturas da sociedade centemperanea, as nevas tecnelegias de 
infermagae - reuninde cempu+aderes, telecemunicagees e redes de 
infermagde - aceleram aquele mevimente de mebilizagde de munde 
real. Aumenta a velecidade da acumulagae de infermagde e sae 
ampliades seus requisites de atuaiizagde, fermate - mais flexivel, 
persenalizade e interative - e, principalmente, de acessibilidade. A 
plataferma digital vem se consolldando como o melo mais simples, 
barate e pederese para tratar a informagae, ternande pessfveis 
neves predutas e serviges e cenquistande neves usuaries. 

Acreditames ser e ambiente de cenversa e centreversia 
e de treca entre as diferentes disciplines , nas mesas redondas e 
sessoes tematicas das Cenferencias Nacienais de (Seegrafia, 
Cartegrafia e Estatistica e do Simpesie de Inevagees, aquele que 
melher enseja e aprimeramente de censense sebre es fenemenes a 
serem mensurades para retratar a sociedade, a ecenemia e e 
Territerie naclonal e sebre as prieridades e fermates das infermagees 
necessaries para o fortalecimento da cidadania, a definigae de 
peliticas publicas e a gestae pelitice - administrativa de Pais, e para 
criar uma sociedade maisjusta. 
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Abstract: This paper is a presentation oflBGE’s Database, the support systems for data 
extraction and management; issues are discussed approa ching the 
modernization of the database and the improvement of its systems from the 
deployment of new technologies. 

1. IBGE’s Database History 

The history of the information collection comprising IBGE’s Database blends in 
with the data processing history of the institution. The awareness of the importance of the 
construction of information assets stored in magnetic media, accessible through electronic 
data processing equipments, resulted in the collection of most microdata from survey 
developed in the past 33 years, thus comprising the largest database on this country. 

Even previously to the definition of standards and procedures necessary to the 
maintenance of a magnetic storage system of this size, microdata were diligently acquired, 
documented and stored. 

In the 70s, w'lh the coming of age oflBGE’s electronic data processing, procedures 
were implemented aiming at ensuring reliable files, available through the database. 

Later, in the early 80s, standardized documentation methods were employed for the 
description of the collection variables, which afforded the completion of knowledge about 
the file structure stored in the Database. 

Finally, by the late 80s, the Database is acknowledged as a Department within the 
Information Technology area of IBGE, which then formally manages the collection of data 
produced and to develop the products which are used to this date, such as our Metadata 
Base. 



2. Present Situation 

2.1 Services Offered 

Endowed with resources, DEB AD created new products aiming at the management 
of data stored and the processes of information recovery by the end user. We presently have 
Microdata Bases, Theme Data bases and Metadata Bases, besides 1,800 conventional 
microdata files for survey. We also have available interfeces for collection access via 
commercial programming languages such as COBOL and PL/1, via tools developed at 
IBGE such as ATLAS, and also via applications used by specialists in statistics such as 




SAS. A user with mainframe access can, today, handle directly all information of his 
interest, without the interference of DEBAD’s technicians. 

For application programmers, the integration of Metadata Base with ATLAS and 
CREPTA compilers simplifies the handling of conventional files. A tool was developed 
which extracts from that bank the description of dictionaries in a format which is compatible 
with that used by those languages. 

For the users of information in SAS environments, a tool was developed for the 
selection of variables, interactively using the descriptions existing in the Metadata Base. The 
tool generates all definitions and commands necessary for handling the microdata and value 
extraction files. 

For the Microdata users, the interfeces developed by DEB AD allow the utilization 
of SAS in a batch or on line environment, extracting variables in a very user-friendly 
manner. The definition of attributes and their relationships is done smartly by the interface in 
a transparent manner for the user, who analyses the data model and defines the procedures 
for the execution of the task. 

For internal use, including the sectors in charge with the dissemination of data at 
IBGE, tools are made available for the generation of descriptive catalogs for the 
information existing in the Metadata Base. In an interactive manner, the user is aided by 
menus to obtain the information sought and, in finding them, generate the catalog. 

2.2 Problems Encountered 

Despite the good service coverage offered by DEB AD, the operational environment 
imposes some restrictions. The exclusive utilization of proprietary methods and protocols 
restricts the utilization of the services to foe internal public and a few data analyses centers 
connected to IBGE’s SNA network. 

The utilizatio n of the data can also be seen as a restrictive dement against access to 
information. Excepting for the survey which was organized in Theme Data Bases, the 
incorporation of data to the collection is done according to the structure used in the survey 
production process and not according to the needs for dissemination and information 
analyses processes. 

The documentation used for the implementation of the Metadata Base was that 
existing in development manuals of survey systems, according to DEBAD’s files. In many 
cases, survey producers did not make foe necessary review for the validation of this 
information, and have not yet complemented them with their semantic description. 

Despite being a concern of internal bodies, there is not yet a definition of policies for 
the democratization of access to microdata which will consider the commercial aspects as 
well as technical and legal issues of the decharacterization of information. 

Finally, the technological innovations from graphics techniques and languages for 
the programming of computer systems, as well as the communications facilities afforded by 
Internet must be incorporated to the methods and processes used in the production of our 
work. 



3. The Metainformatiion System 

DEB AD has bear attempting to systematize the statistical metainformation 
recording process in its data management system for over 15 years. Initially all efforts were 
directed to the identification of the constitution elements of statistical metadata and the 
definition of operating standards and procedures for its acquisition. 




As Yves Franchet, General Director of EUROSTAT [FRAN93], a piece of 
information has no meaning in itself. Its relevance is restricted to a specific context, usually 
well known by the information producers, however unknown to the majority of users. 

3.1 Metadata Base 

In 1986, a database project was initiated, which was implemented in 1989 under the 
name Metadata Base. This base was implemented using the Database Management System, 
IDMS, basically contemplating statistical data, to the extent that these were the majority of 
the collection of the database. 

The manner of interaction with the Metadata Base is quite limited. Its interface was 
built rising the technological interfaces which were available at the time, resulting in an 
interface based on commands executed from ranked menus. Two systems are made 
available by the interface: an updating system oriented to the maintenance of database 
information; and a inquiry system oriented to the users ofIBGE’s SNA network. 

The Metadata Base allows the record of: 

• survey according to producing areas; 

• files produced in the marry occurrences of each survey; 

• the description of the data dictionary of each file; 

• the classifications and code categories used; and 

• the critique plan for survey using CREPTA. 

An indexation system for the database variables was developed and partially 
implemented, aiming at the definition of a documentary language for the construction rf 
IBGE’s thesaurus and tools for information search and retrieval 

3.2 The MetaBD Project 

The Metadata Base review work - MetaBD - is underway, within IBGE’s Database 
Revamping and Reformulation project. The MetaBD will add to metadata descriptions of 
information originating from IBGE’s geoscientific area: geography, territory structures, 
cartography, geodesy, natural resources and environmental studies. Besides, information are 
also contemplated about IBGE’s products available in other media (other than magnetic). 
Thus, MetaBD will reference myriad institutional products such as, for example, statistical 
yearbooks and maps. MetaBD is being developed in a relational environment, using SGDB 
ORACLE, in an open platform, in a client-server architecture. It can be accessed through 
Internet, adding new user categories to the services provided by DEBAD in 
metainformation. 

Whereas the pioneer Metadata Base freed the challenge of organizing a large data 
volume, MetaBD prime objective wifi be to make IBGE more visible to the public at large. 
With MetaBD external users (experts or otherwise) may investigate the availability of data 
in the collection, from inquiries made in the respective metadata. Besides the existence of 
information, it will be possible to obtain its semantic description, its storage structure and 
the instructions for data extraction from the many diss eminati on systems and the analysis of 
the information held by IBGE. 

3.3 The Integrator Database (BDI) 

The new IT environment proposed for the organization and access to data at IBGE, 
to be managed and controlled by DI, wifi be in fret composed by two main databases: one 




Metadata base (MetaBD) and the Integrator Database (BDI). While MetaBD contains 
reference and descriptive information relating to data in the collection, BDI will contain all 
remaining information required for integrated access to the many databases, these being 
under the control of a Database Management System or otherwise. 

3.3. 1 Organizing IBGE’s Data Collection 

In practice, BDI will represent the overall scheme of IBGE’s data collection, today 
organized in many management systems and conventional files in different computing 
platforms. This organization is proposed bearing in mind heterogeneous database systems, 
today a reality and technological requirement. 

The different systems and manners of data organizing must coexist in the most 
integrated manner, because that is a natural process in large organizations such as IBGE. 
These multiple systems and data are operated and managed under local, stand-alone rules, 
with data models, languages and many representations, often for similar information. Thus, 
the existence of another level of data abstraction is justifiable, providing access to the many 
conventional systems and files, without need for changes and adaptations to existing 
systems. 

3.3.2 Access to Data in the Collection 

The many applications which require data access may do so using responses to BDI 
inquiries (and also to MetaBD), which will serve as a generic interface between application 
programs and the data collection. This will be valid for applications requiring microdata as 
well for agg regated data and products in general. BDI will operate as a data integrating 
metabase, acting as the sole pathway for conventional files and somehow complementing 
the data dictionaries of existing DBMSs. 

3.3.3 A Uniform View of Data 

It is also expected that BDI wifi act as a large data depository, speeding and 
facilitating access to them, independently of platform and models of origin 

One of the main functions ofBDI wifi be to provide managers, developers and users 
in general with a uniform view of different data and databases. 

3.4 Outlooks 

DEB AD is pasting through an internal discussion process regarding the organization 
of the existing data collection and the incorporation of new data to the same collection. A 
fruit of IBGE’s Database Revamping and Reformulation process, it is expected, among 
other things, that: 

• the migration from the present to the new situation processes without major 
hurdles, because present systems and data collections will be held operational 
until new releases are ready to come on line; 

• new access interfaces will afford both the dissemination of IBGE’s products to 
the public at large (Internet) as well as a greater integration of those responsible 
for the products with the ones responsible for the storage of respective 
information and, still, with those who will make information available (Intranet); 




• IBGE’s product development process be endowed with tools and techniques to 
ensure the consistency of information and make feasible the internal and external 
access to the base with a uniform view of data. 

The first phase of the project - MetaBD definition and implementation - is coming to 
a close, while other access interfeces, definition and implementation of BDI and the 
development of application programs for the new environment are being specified. 

4. Incorporation of Survey to a Shared Information Processing Environment 

Theoretically, every piece of information produced at EBGE is a candidates for 
incorporation into the Database collection. We understand that data must be incorporated 
to the collection as soon as critiqued and ready for tabulation work. At this stage, foe 
specialists in data management, together with those from the production and dissemination 
areas, must adopt a data structure which aims at facilitating data handling fo information 
analysis and dissemination processes. 

A production system, such as foe ‘96 Agricultural and Livestock Census, which 
considers a shared base at the state level, may show a greater efficiency in foe production 
processes and even for state analysis. However, in order to be used in foe state scope and 
satisfy the most diverse information demand profiles, databases such as these must be 
reviewed, aiming at facilitating data dissemination and statistical analysis. 

As in other data incorporation processes, shared bases must also be foe object of 
incorporation to foe data collection and documentation in IBGE’s metainformation system. 
DEB AD must be foe body responsible for foe management of data distribution and 
replication. Together with the producing and disseminating bodies, it must be responsible 
for foe definition of fragmentation and replication processes necessary to maintain 
availability, integrity and efficiency in foe handling of shared databases. 

5 Utilization of International Standards 

The advent of international communication networks, such as Internet, highlights foe 
importance of the adoption of international standards in order to make feasible the 
electronic exchange of information. 

The physical feasibility of these services depends on the availability of foe 
communications media (network) which is being implemented at IBGE, and on an efficient 
collection of metadata. It also depends on the construction of access interfaces, of manners 
of data organization and semantic descriptions winch are internationally recognizable. Thus, 
even if the users farthest from the information production processes may access the 
database for their information analysis, data extraction or information exchange. 

6. Data Management Policy 

It is important for IBGE that the data management policy be decentralized, which 
does not imply in sharing MetaBD. The volume of metadata does jot justify sharing, which 
would add critical factors such as the definition of sharing criteria, assurance of consistency 
of copies, difficulties in controlling access, etc. 

The Data Management Policy to be adopted must have management support starting 
with senior management, establishing standards and methodologies for foe production of 
new data. 




DEB AD wishes to assume responsibility for the collection, however this will require 
that survey developing agencies respect incorporation standards and patters, ensuring the 
consistence of information from collection through dissemination. 

7. Conclusions 

As indicated, DEB AD has a relevant role to play in the information production as 
well as in the dissemination processes. It acts as a body specialized in providing advice in 
the deployment of techniques for databases and as an executing body vis-a-vis database 
organization, collection build-up and information dissemination tasks. 

DEB AD may make feasible the access to microdata bases to the external public, as 
for IPEA, as long as IBGE defines a commercial and information confidentiality policy, as 
done in some countries such as Norway. 

MetaBD may become a significant inquiry source to the entire collection of data 
produced at IBGE. IBGE may respond on the availability of foe information, production 
processes adopted and how and when to obtain it 

Finally, it is well to emphasize the importance of foe participation of experts 
responsible for foe production of statistical and geosciendfic information in foe data 
management and metadata definition at the Institution. Only thus will we be able to expand 
foe visibility ofIBGE’s Database collection to every user. 




GLOSSARY 



COBOL Common Business Oriented Language - A high-level programming 
language oriented to commercial applications. 

PL/1 Programming Language One - A high-level programming language 

oriented to commercial and scientific applications. 

ATLAS A language for the development of critique, inputting and tabulation 

programs, developed at IBGE 

CREPT A A language for the specification of automatic critique and inputting 

programs developed at IBGE. 

SAS Statistical Analysis Systems - A high-level programming language 

oriented to the exploration of statistical dam. 

SNA Systems Networking Architecture • A communications network 

protocol, 

IDMS Integrated Database Management System - A database manager 

using the network approach. 

ORACLE A database manager using a relational approach. 
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